# GUI agent
Internvl3 8B
Apache-2.0
InternVL3 - 8B is an advanced multimodal large - language model with excellent multimodal perception and reasoning capabilities, capable of processing multimodal data such as images and videos.
Multimodal Alignment
Transformers

I
unsloth
224
1
Internvl3 1B GGUF
Apache-2.0
InternVL3 - 1B is an advanced multimodal large language model that excels in multimodal perception, reasoning, and other abilities. It also expands multimodal capabilities such as tool use and GUI agent.
Multimodal Fusion
Transformers

I
unsloth
868
2
Internvl3 14B Hf
Other
InternVL3-14B is a powerful multimodal large language model that excels in multimodal perception and reasoning abilities and supports multiple inputs such as images, texts, and videos.
Image-to-Text
Transformers Other

I
OpenGVLab
4,260
0
Internvl3 8B
Other
InternVL3-8B is an advanced multimodal large language model with excellent multimodal perception and reasoning capabilities, and performs well in multiple fields such as tool use, GUI agents, and industrial image analysis.
Multimodal Fusion
Transformers Other

I
FriendliAI
167
0
Featured Recommended AI Models